AITopics | stochastic expectation maximization

State-space models provide a powerful framework for describing the evolution of hidden states in dynamical systems [3], [4], [1]. Conventionally, state-space models assume Gaussian measurement and state noise, owing to their tractability and well-characterized statistical properties. However, many real-world phenomena are subject to perturbations that deviate from the conventional Gaussian noise assumption. In radio interferometry, for instance, observational data are frequently corrupted by non-Gaussian noise sources such as radio-frequency interference (RFI) [5], [2], which originates from man-made signals and introduces significant distortions into astronomical measurements [6], [30]. Such interference produces sporadic high-power spikes in the measured visibilities, leading to heavy-tailed statistics. Many radio-interferometric reconstruction methods assume Gaussian additive noise [7], [31], [33], [35], an approximation that can lead to inaccurate reconstructions when the heavy-tailed nature of real-world measurement noise is not properly accounted for. In the realm of state-space modeling, addressing non-Gaussian noise has led to the development of various methodological approaches, notably particle filtering and non-conventional Kalman filters. Particle filters [8], or Sequential Monte Carlo methods, are designed to handle non-linear and non-Gaussian state-space models by representing the posterior distribution with a set of weighted samples [9], [10], [32].

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

2606.23944

Country: Europe (0.14)

Genre: Research Report (0.82)

Industry: Media (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.90)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.89)

Add feedback

Stochastic Expectation Maximization with Variance Reduction

Neural Information Processing SystemsMar-16-2026, 23:31:35 GMT

Expectation-Maximization (EM) is a popular tool for learning latent variable models, but the vanilla batch EM does not scale to large data sets because the whole data set is needed at every E-step. Stochastic Expectation Maximization (sEM) reduces the cost of E-step by stochastic approximation. However, sEM has a slower asymptotic convergence rate than batch EM, and requires a decreasing sequence of step sizes, which is difficult to tune. In this paper, we propose a variance reduced stochastic EM (sEM-vr) algorithm inspired by variance reduced stochastic gradient descent algorithms. We show that sEM-vr has the same exponential asymptotic convergence rate as batch EM. Moreover, sEM-vr only requires a constant step size to achieve this rate, which alleviates the burden of parameter tuning. We compare sEM-vr with batch EM, sEM and other algorithms on Gaussian mixture models and probabilistic latent semantic analysis, and sEM-vr converges significantly faster than these baselines.

artificial intelligence, machine learning, proceedings, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.61)

Add feedback

Stochastic Expectation Maximization with Variance Reduction

Neural Information Processing SystemsNov-20-2025, 22:47:44 GMT

Expectation-Maximization (EM) is a popular tool for learning latent variable models, but the vanilla batch EM does not scale to large data sets because the whole data set is needed at every E-step. Stochastic Expectation Maximization (sEM) reduces the cost of E-step by stochastic approximation. However, sEM has a slower asymptotic convergence rate than batch EM, and requires a decreasing sequence of step sizes, which is difficult to tune. In this paper, we propose a variance reduced stochastic EM (sEM-vr) algorithm inspired by variance reduced stochastic gradient descent algorithms. We show that sEM-vr has the same exponential asymptotic convergence rate as batch EM. Moreover, sEM-vr only requires a constant step size to achieve this rate, which alleviates the burden of parameter tuning. We compare sEM-vr with batch EM, sEM and other algorithms on Gaussian mixture models and probabilistic latent semantic analysis, and sEM-vr converges significantly faster than these baselines.

name change, stochastic expectation maximization, variance reduction, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.61)

Add feedback

Stochastic Expectation Maximization with Variance Reduction

Neural Information Processing SystemsNov-20-2025, 19:11:51 GMT

However, sEM has a slow asymptotic convergence rate due to the high variance of each update.

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country:

South America > Paraguay > Asunción > Asunción (0.04)
Asia > Middle East > Jordan (0.04)
North America > Canada > Quebec > Montreal (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.99)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
(2 more...)

Add feedback

Reviews: Stochastic Expectation Maximization with Variance Reduction

Neural Information Processing SystemsOct-7-2024, 23:36:12 GMT

It builds on the classical stochastic EM of Cappé and Moulines (2009), with an extra variance reduction term in the iteration formula. This variance reduction technique is inspired from the stochastic gradient descent literature (in particular, algorithms developed in Le Roux et al 2012, Johnson and Zhang 2013, Defazio et al 2014). After setting up the background to present previous results in a unified and clear way, the authors present their algorithm and show two theoretical properties of it: a local convergence rate, and a global convergence property. In the last section, they compare their new algorithm to several state-of-the-art methods, on a Gaussian mixture toy example, and on a probabilistic latent semantic analysis problem.

algorithm, stochastic expectation maximization, variance reduction, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.56)

Add feedback

Geom-SPIDER-EM: Faster Variance Reduced Stochastic Expectation Maximization for Nonconvex Finite-Sum Optimization

Fort, Gersende, Moulines, Eric, Wai, Hoi-To

arXiv.org Machine LearningNov-24-2020

The Expectation Maximization (EM) algorithm is a key reference for inference in latent variable models; unfortunately, its computational cost is prohibitive in the large scale learning setting. In this paper, we propose an extension of the Stochastic Path-Integrated Differential EstimatoR EM (SPIDER-EM) and derive complexity bounds for this novel algorithm, designed to solve smooth nonconvex finite-sum optimization problems. We show that it reaches the same state of the art complexity bounds as SPIDER-EM; and provide conditions for a linear rate of convergence. Numerical results support our findings.

algorithm, approximation, expectation maximization, (11 more...)

arXiv.org Machine Learning

2011.12392

Country:

Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
Europe > Netherlands > South Holland > Dordrecht (0.04)
Asia > Middle East > Jordan (0.04)
(4 more...)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.89)

Add feedback

Stochastic Expectation Maximization with Variance Reduction

Chen, Jianfei, Zhu, Jun, Teh, Yee Whye, Zhang, Tong

Neural Information Processing SystemsFeb-14-2020, 19:58:20 GMT

Expectation-Maximization (EM) is a popular tool for learning latent variable models, but the vanilla batch EM does not scale to large data sets because the whole data set is needed at every E-step. Stochastic Expectation Maximization (sEM) reduces the cost of E-step by stochastic approximation. However, sEM has a slower asymptotic convergence rate than batch EM, and requires a decreasing sequence of step sizes, which is difficult to tune. In this paper, we propose a variance reduced stochastic EM (sEM-vr) algorithm inspired by variance reduced stochastic gradient descent algorithms. We show that sEM-vr has the same exponential asymptotic convergence rate as batch EM.

algorithm, stochastic expectation maximization, variance reduction, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.65)

Add feedback

Stochastic Expectation Maximization with Variance Reduction

Chen, Jianfei, Zhu, Jun, Teh, Yee Whye, Zhang, Tong

Neural Information Processing SystemsDec-31-2018

Expectation-Maximization (EM) is a popular tool for learning latent variable models, but the vanilla batch EM does not scale to large data sets because the whole data set is needed at every E-step. Stochastic Expectation Maximization (sEM) reduces the cost of E-step by stochastic approximation. However, sEM has a slower asymptotic convergence rate than batch EM, and requires a decreasing sequence of step sizes, which is difficult to tune. In this paper, we propose a variance reduced stochastic EM (sEM-vr) algorithm inspired by variance reduced stochastic gradient descent algorithms. We show that sEM-vr has the same exponential asymptotic convergence rate as batch EM. Moreover, sEM-vr only requires a constant step size to achieve this rate, which alleviates the burden of parameter tuning. We compare sEM-vr with batch EM, sEM and other algorithms on Gaussian mixture models and probabilistic latent semantic analysis, and sEM-vr converges significantly faster than these baselines.

artificial intelligence, bayesian inference, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > China (0.28)
Europe (0.28)

Add feedback

Stochastic Expectation Maximization with Variance Reduction

Chen, Jianfei, Zhu, Jun, Teh, Yee Whye, Zhang, Tong

Neural Information Processing SystemsDec-31-2018

Expectation-Maximization (EM) is a popular tool for learning latent variable models, but the vanilla batch EM does not scale to large data sets because the whole data set is needed at every E-step. Stochastic Expectation Maximization (sEM) reduces the cost of E-step by stochastic approximation. However, sEM has a slower asymptotic convergence rate than batch EM, and requires a decreasing sequence of step sizes, which is difficult to tune. In this paper, we propose a variance reduced stochastic EM (sEM-vr) algorithm inspired by variance reduced stochastic gradient descent algorithms. We show that sEM-vr has the same exponential asymptotic convergence rate as batch EM. Moreover, sEM-vr only requires a constant step size to achieve this rate, which alleviates the burden of parameter tuning. We compare sEM-vr with batch EM, sEM and other algorithms on Gaussian mixture models and probabilistic latent semantic analysis, and sEM-vr converges significantly faster than these baselines.

artificial intelligence, bayesian inference, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > China (0.28)
Europe (0.28)

Add feedback

Filters

Collaborating Authors

stochastic expectation maximization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Stochastic Expectation Maximization for Robust State-Space Radio Interferometric Imaging

Stochastic Expectation Maximization with Variance Reduction

Stochastic Expectation Maximization with Variance Reduction

Stochastic Expectation Maximization with Variance Reduction

Reviews: Stochastic Expectation Maximization with Variance Reduction

Geom-SPIDER-EM: Faster Variance Reduced Stochastic Expectation Maximization for Nonconvex Finite-Sum Optimization

Stochastic Expectation Maximization with Variance Reduction

Stochastic Expectation Maximization with Variance Reduction

Stochastic Expectation Maximization with Variance Reduction